A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection

نویسندگان

چکیده

Humans tend to mine objects by learning from a group of images or several frames video since we live in dynamic world. In the computer vision area, many researchers focus on co-segmentation (CoS), co-saliency detection (CoSD) and salient object (VSOD) discover co-occurrent objects. However, previous approaches design different networks for these similar tasks separately, they are difficult apply each other. Besides, fail take full advantage cues among inter- intra-feature within images. this paper, introduce unified framework tackle issues view, term as UFGS (Unified Framework Group-based Segmentation). Specifically, first transformer block, which views image feature patch token then captures their long-range dependencies through self-attention mechanism. This can help network excavate patch-structured similarities relevant Furthermore, propose an intra-MLP module produce self-mask enhance avoid partial activation. Extensive experiments four CoS benchmarks (PASCAL, iCoseg, Internet MSRC), three CoSD (Cosal2015, CoSOD3k, CocA) five VSOD (DAVIS $_{16}$ , FBMS, ViSal, SegV2 DAVSOD) show that our method outperforms other state-of-the-arts both accuracy speed using same architecture, reach 140 FPS real-time. Code is available at https://github.com/suyukun666/UFO

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Saliency Detection by Selective Strategy for Salient Object Segmentation

Saliency detection is useful for many computer vision tasks including content-based image retrieval, segmentation, and object detection. However, methods on saliency detection are usually greatly affected by factors like features and segmentation results. We propose a novel selective segmentation-based saliency detection model to decrease the side effects caused by these factors. After extracti...

متن کامل

Salient Object Detection and Segmentation

Automatic estimation of salient object regions across images, without any prior assumption or knowledge of the contents of the corresponding scenes, enhances many computer vision and computer graphics applications. We introduce a regional contrast based salient object extraction algorithm, which simultaneously evaluates global contrast differences and spatial weighted coherence scores. The prop...

متن کامل

Efficient Co-Salient Video Object Detection Based on Preattentive Processing

Automatic video annotation is a critical step for contentbased video retrieval and browsing. Detecting the focus of interest such as co-occurring objects in video frames automatically can benefit the tedious manual labeling process. However, detecting the co-occurring objects that is visually salient in video sequences is a challenging task. In this paper, in order to detect co-salient video ob...

متن کامل

Three Birds One Stone: A Unified Framework for Salient Object Segmentation, Edge Detection and Skeleton Extraction

In this paper, we aim at solving pixel-wise binary problems, including salient object segmentation, skeleton extraction, and edge detection, by introducing a unified architecture. Previous works have proposed tailored methods for solving each of the three tasks independently. Here, we show that these tasks share some similarities that can be exploited for developing a unified framework. In part...

متن کامل

Temporally Object-based Video Co-Segmentation

In this paper, we propose an unsupervised video object cosegmentation framework based on the primary object proposals to extract the common foreground object(s) from a given video set. In addition to the objectness attributes and motion coherence our framework exploits the temporal consistency of the object-like regions between adjacent frames to enrich the set of original object proposals. We ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Multimedia

سال: 2023

ISSN: ['1520-9210', '1941-0077']

DOI: https://doi.org/10.1109/tmm.2023.3264883